MATHEMATICAL CONTROL 
THEORY SOLUTION OF AN INTERACTIVE ACCOUNTING 
FLOWS MODEL* 


Yuji Biri 
and 


Gerald L. Thompson 


Graduate School of Industrial Administration 
Carnegie-Mellon University 


ABSTRACT 


In this paper we have applied the mathematical control theory to the accounting network 
flows, where the flow rates are constrained by linear inequalities. The optimal control policy 
is of the “generalized bang-bang” variety which is obtained by solving at each instant in 
time a linear programming problem whose objective function parameters are determined 
by the ‘‘switching function” which is derived from the Hamiltonian function. The interpre- 
tation of the adjoint variables of the control problem and the dual evaluators of the linear 
programming problem demonstrates an interesting interaction of the cross section phase 
of the problem, which is characterized by linear programming, and the dynamic phase of 
the problem, which is characterized by control theory. 


1. INTRODUCTION 


This paper is a sequel to our previous paper [1] which dealt with a wheat trading model in which 
only one product is to be bought and sold subject to maximum purchase and sales limits, when the price 
variation over time is known with certainty. 

In this paper, we shall extend the applications of mathematical control theory to multiple product 
cases where more than one product is to be bought (produced) and sold under price certainty, subject 
to various kinds of linear constraints on purchases (production) and sales. We will use notation similar 
to that used in [1]. 

Our results are as follows: The adjoint variables have interpretations similar to those in the pre- 
vious paper [1]. The optimal control policy is of the “generalized bang-bang”’ variety which is obtained 
by solving at each instant in time a linear programming problem whose objective function parameters 
are determined by the “switching functions” of the control problem. The optimal policy then “bangs” 
from one extreme point of this linear programming problem to another as the switching functions 
change over time. A simple example is worked out to show how the optimal policy changes as prices 
change over time. 


2. THE CONTINUOUS JOINT TRADING MODEL 
We shall describe this model in terms of a trading situation, but it can easily be reinterpreted in 
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terms of a model in which there is the joint production and sale of two different products. 


Let us consider a firm that engages in trading two kinds of commodities. wheat and rice. The 
firm’s assets are of only three kinds, cash. wheat, and rice, whose balances at time ¢ are represented by 
c(t), w(t), and v(t), respectively. These stocks are measured as follows: c(t) in dollars, and w(t) 
and v(t) in bushels or in some other physical units. The price of wheat and the price of rice from time 
0 to time T are assumed to be known in advance with certainty and are given by functions pi(¢) and 
P2(t), respectively. It is also assumed that for each commodity there is only one price at any point in 
time which is applicable both for buying and selling the commodity. We shall let Pi = pi(T) and P,= 
p(T) be the terminal prices of wheat and rice, respectively. We assume that the initial assets Co= 
c(0), Wo=w(0), and Vo=v(0) are known constants. 

The firm’s objective is to buy and sell wheat and rice during the time period 0 to T so as to maximize 
the total value of its assets at time T. Hence, it always values its current inventories with the terminal 
prices in making trading decisions. Thus the total value of the assets of the firm at time t, denoted by 
x(t), is given by 


QQ) x(t) =c(t) + Piw(t) + Prv(t). 


We denote the rates of buying wheat and rice at timet by mip (t) and mz,(t), respectively, and the rates 
of selling wheat and rice at time t by mis(t) and mes(t) , respectively. They are all expressed in physical 
units of wheat and rice. We shall impose the following constraints on these variables which are assumed 


to reflect the buying and selling environments of the firm’s operations. 
(2) Aypmy <b, 

Asm, <b, 

Mp, ms = 0, 


where the matrices Ap (kX 2), As (n X2), bp (kX1), and bs (n X1) are given constant matrices, and 
m, is the column vector with components mj, and m2,, and m; is the column vector with components 
mys and my. 

Sometimes there are additional constraints that connect buying and selling activities of the firm 
of the form, 


(3) Dymyp + Dems = ba, 


where the matrices D, (h X 2), Ds (h X 2), and ba (h X 1) are given constant matrices, and D, and D, 
each containing at least one nonzero constant in each row. 

The only assumption that we shall make concerning the convex set defined by the constraints 
(2) and (3) is that there is at least one feasible vector satisfying them. Thus the constraint set may be 
unbounded. As we shall see, this may lead to a maximizing linear programming problem with unbounded 


solution without affecting our main results. Of course, in any reasonable application the solution to 
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such a maximizing problem will turn out to be bounded since unbounded solutions cannot be realized 
in practice. 

Many other kinds of constraints can be imagined. For instance, constraints (such as those in- 
volving liquidity or cash flow) over time, or constraints involving both state and control variables. 
However, we shall restrict this paper to the consideration of problems that involve only the constraints 
in the form of (2) and (3) and that there is at least one feasible vector satisfying them. 

As long as the constraints (2) and (3) are satisfied, the firm is permitted to be either long or short 
in any or all of the three asset types, cash, wheat, and rice. A long position is indicated by a positive 
stock of an asset and a short position by a negative stock. 

There are four factors that change the cash balance. One is the interest received on cash (or paid 
on a Joan) balance at a given constant rate R per unit period, which is to be accrued continuously and 
is to be received (or paid) in cash. That is, the interest on a cash or loan balance is compounded con- 
tinuously at interest rate R. The second factor is the inventory holding costs hi (w) for wheat and h2(v) 
for rice that must be paid in cash on inventories of sizes w(t) and v(t) at time t. The functions h; and 
hy are piecewise differentiable functions of w and v, respectively. They are increasing functions of 
w and v for their nonnegative values and decreasing functions for the nonpositive values. Thus we in- 
terpret the inventory holding cost to be the cost of being short in wheat or rice when w <0 or v <0, 
which increases as |w]| or |v| increases. The third factor changing the cash balance is the cost of pur- 
chasing wheat and/or rice, p;(t)mip(t) + P2(t)mep(t) at time t. The fourth factor is the revenue from 
sales of wheat and rice, p,(t)ms(t) + po(t)mee(t). Thus, the differential equation that expresses the 
manner in which the stock of cash changes is 


(4) é(t) =Re(t) — hi (w) —he(v) — pi (t) mip(t) — po (t) mep(t) + pi (t)mis(t) + po (t) mes(t). 


There are only two factors that affect the stock of wheat and rice, namely the purchase and sales 
quantities. Thus, the differential equations for w and v are 


(5) w(t) = mip(t) — mis (t) 
and 
(6) v(t) = mep(t) — mas (t). 


In order to simplify notation, we shall drop (t) from all expressions with the understanding that 
lower case letters (except b and e) represent functions of time unless other independent variables are 
specified (e.g., h(w)), and that all upper case letters represent constants. 

To put the problem into standard Mayer form, we differentiate (1) obtaining: 


(7) x=c+Piw+ pov=Re—h,(w) —hz(v) — pimp — pomep 
+ pigs + patties + Pytmp — Pymis+ Pomep — Parnas. 


The latter expression was obtained by using (4)-(6). 
The firm’s objective function is now expressed by 


(8) Max x(T) =c(T) 4+ Piw(T) + Pov (T) 
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subject to the differential equations (4)-(7) and the constraints (2) and (3). 
In order to solve the problem, we introduce adjoint variables Y(t) for (4), d(t) for (5), p(t) for 6), 
and the constant function —1 for (7). Then the Hamiltonian function is given by 


(9) H(t) = W(Re “= hi(w) —he (v) — PiMip — PoMap + Pimys + P2M25) +E b(mp = mis) 
+ p(mep = Mas) — (Re — hy (w) a he (v) — Pimp — P2M2zp + PiMis + P2Mes5 
+ Pimip — Pimis + Pomep — P2mzs) 


= (v — 1) (Re —— hi(w) as he(v)) — 4; (mip = ms) — 42(mezp we M25), 


where 4, and 42 are the switching functions defined as 


(10) 4,=P,+p,(V¥-1)—-¢ 
and 
(11) 42 = Py + po(W—1)—p. 


The determination of VY, @, and their interpretations together with those of #, 4, and 22 are given 
in Section 4. They are entirely analogous to the corresponding functions in Reference [1]. 

The maximum principle [2], provides as a necessary condition for an optimal policy that the 
control variables m should be chosen so as to minimize #, which implies the maximization of the 
criterion function 


(12) Z= 4)(Myp — mys) +-42(mzp — Mos) = 4(mp — Ms), 


subject to the control variable constraints (2) and (3). In other words, at each point of time, the optimal 
policy involves the solution of an ordinary linear programming problem. The coefficients of the objec- 
tive function in (12) are the switching functions 4; and 42. Since these are functions of time, it follows 
that what is involved is a parametric solution of this linear programming problem. We summarize our 
results in the following theorem. 

THEOREM: If the constraint set defined by (2) and (3) is nonempty, then the optimal control 


variables m are determined by solving at each time ¢ the linear programming problem: 


Max 4(myp — ms). 
Subject to 


Apmyp = bp, 

(13) Ams < bs 
Dypmp + Dgms = ba 

Mp,ms = 0, 


where 2= (41, .42) is given by (10) and (11). 
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PROOF: The calculations above show that necessary conditions for optimal control are that 
the control variables should solve at each point in time the linear programming problem (13). We must 
still show that these necessary conditions are sufhcient. The sufficiency follows immediately, however, 
by observing that the constraints of the linear programming problem can be assumed (by well-known 
results) to be nondegenerate so that the solution to the linear programming problem is unique, except 
for the finite number of values of 4, and 42, The example in the next section illustrates the result. 

It should be remarked that, if the solution to (13) is infinite, as may well happen for certain values 
of the switching function in the case that the constraint set is unbounded, our conclusion is still valid 
even though this gives an infinite value to the Hamiltonian. It may also happen that both mp, and m, 
become infinite while the quantity mp— mz, remains finite, so that the Hamiltonian remains finite. This 
case also causes no difficulty. Obviously neither of these possibilities will arise in a problem derived 
from an actual application since solutions involving infinite rates of buying cannot realistically be carried 
out over any finite length of time. 

There are two special cases worthy of mention and further detail. The first case is that of separable 
constraints which occurs when constraints of the form (3) are absent. Here the linear programming 
problem (13) breaks up into two smaller problems and hence it may be considerably easier to solve. 
The second case is that of coordinated control which occurs whenever my -ms= 0. In other words, a 
coordinated control problem is one in which there is never a reason to simultaneously buy and sell at 
a given time. Again, in realistic applications this is most likely to be the case because of the normal 
workings of the market. 


3. AN EXAMPLE 


In this section, we shall analyze the multiple product control problem discussed above by a simple 
numerical example. 

Let us suppose that the rates of inventory holding cost hi(w) and he(w) are specifically given as 
follows: 


(14) hi(w) =0.20 |w| 
(15) h2(v) =0.10 |v} 


The interest rate is 0.01. Both the rate of inventory holding cost and the interest rate are expressed 
as monthly rates. We shall consider 7=20 months for the planning period, and assume the terminal 
prices of wheat and rice to be $15 and $20, respectively. In this analysis, we shall also assume (for 
simplicity) that the signs of w and v are always positive, i.e., the firm never runs into a short position 
for the inventories. If the problem involves changes in the sign of w or v, the problem can be solved 
by the techniques described in our previous paper [1]. 


Using the result obtained in the previous paper, we can write the switching function 4 and 2 as 


(16) 4, = e?-9 (yu, — pi) 
and 
(17) 42=e%T-9(u,— pe), 


where 
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(18) ee [ae 4 P| Ree {w) 
and 

hs hi 
(19) l= [B+ Ps] emr-o EW). 


Substituting T= 20, Ai (w) =0.20, ho (v) =0.10, P; = 15, P,»=20, and R=0.01, we have 


(20) Uy = 35e79- 120-9 — 20 
and 
(21) l= 30e-9-01(20-) — JQ. 


In order to simplify the subsequent computations, let us assume the following values of p, and po. 


D1 = 35e79-01(20-9 — ¢ — 20 for 0<t<4 
(22) pPi=35e-9- 120-9 + ¢—28 = for 4¢< 14 
Pi = 35e-9-1020-9 — ¢ for 14<t =< 20 
p2= 30e-°-29-9 + ¢—-16 = for OX t= 12 
(23) P2= 30e~9-120-9 —¢+ 8 for 12<t<19 
Po = 30e~0-9120-9 + ¢ — 30 for 19<t<20. 


The numerical values at the end of each month based on (22) and (23) are given for p; and pz in Table 1. 


TABLE | 


The figures in Table 1 are only samples, since p; and pz must be specified for all values of t and not 


limited to its integer values in the continuous wheat and rice trading model. It is important here to 
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emphasize that the above functions for prices are selected only for the sake of simple illustrations and 
that the solution algorithm described in this paper can be applied to arbitrary price functions. 
From (22), (23) and (20), (21), we obtain the values of u; — pi, and uz — pe as follows: 


uj—pr=t for 0OSt<4 
(24) uj—pi=—8~—t for 451414 
uy— pi=t—20 for 14<4¢<20 


Ug — po=—t+6 for 0St¢<12 
(25) Ug— pe =t—18 for 12<=¢<19 
U2 — po=—t+20 for 19 =¢ =< 20. 


We now consider the constraints on the control variables. Suppose that the purchases constraints 
and the sales constraints are set up as follows: 


Mip +may <6. 
(26) Mp +2my <10 
3mipt+mep <= 12 
Mipy May =0 
ms tm <5 
(27) 3m, +2mos < 12 
Ms, Mo, 20. 


These constraints are shown in Figure I(a) and 1(b). Note that since (26) and (27) are the only constraints, 
this problem is a separable control problem. 
The solutions to the linear programming problems discussed earlier must take on one of the ex- 


treme points shown in the above figure or on a point of a convex combination of two adjacent extreme 


ms yy 
points. Which extreme point is selected depends upon the ratio rae For example, if = ], then (2, 4) 
2 1 


ab 
and (3,3) are equally preferred since 2.4; + 4.42,=3.4;+3.2. But if > 1 then (2, 4) becomes better. As <2 
1 1 
ty 
becomes larger and larger eventually (0, 5) is preferred to (2, 4) which occurs when =~ > 2. Similarly as 
1 
a, ‘ 4a, 1 
pmo (3, 3) is preferred to (2, 4) and asz< 3° (4, 0) becomes preferable to (3, 3). If 4; and 42 are both 
1 1 
; : i a ; as 
negative, the sales constraints become effective. If 5 > 1, (0, 5) is better than (2, 3), but as [~< 1 the 
1 1 


a 
latter is preferred until > < é at which time (4, 0) is preferred to (2, 3). 
1 


These may be summarized in Figure 2. The numbers in brackets show the optimum values of 
the control variables, the first number for mip or mis and the second number for mp or mos. If the 
number is positive, then that number is the value of mip or map, depending upon the element, and 
Mis OF Mag is set equal to zero. If the number is negative, the negative of that number is the value of 
M15 OF Mz,, depending upon the element, and mp or mep is set equal to zero. 
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2p 


Purchases 
Constraints 


(4,0) m. 
(a) 


lp 


Sales 
Constraints 


(4,0) m 
(b) 


1s 


FiGuRE 1 


Now we relate this analysis with the values of u;— pi; (i=1, 2) which obtained earlier. Since only 


u2— 


iy 
. ’ . . 2 2 . 
the ratio of 4, and 42 is crucial, and since ice oa we may use “;—p; and u2— pe in place of 
1 Ur—~Pi 


4; and 42. Then with the equations in (24) and (25) together with Figure 2, we obtain Figure 3. The broken 
line indicates the change in wu, — p; and u2 — p2 ast changes. The numbers represent the month. 


This figure completely specifies the optimum strategy. For example, during Month 2, = > 2, 
1” Pt 


U2— P2 


hence we should purchase only rice at the rate of 5 units per month. However, decreases over 


ui— Pr 
time and becomes equal to 1 at the end of Month 2, hence the strategy for Month 3 is to purchase 2 
units of wheat and 4 units of rice. Similarly, in Month 13 we should sell only rice at the rate of 5 units 
a month and Month 14 we should sell 2 units of wheat and 3 units of rice per month. 
During the last 2 months sales of wheat at the rate of 4 units a month and purchases of rice at the 


rate of 5 units a month are required. These are summarized in Table 2. 
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FIGURE 2 


TABLE 2 


Optimum monthly purchase or sales rate 


Months 

No trade Buy 5 units 
3 Buy 2 units Buy 4 units 
4,5 Buy 3 units Buy 3 units 

6 Buy 4 units No trade 
7,8 Buy 4 units Sell 5 units 
9-13 | No trade Sell 5 units 
14 Sell 2 units Sell 3 units 

15-18 | Sell 4 units No trade 
19, 20 | Sell 4 units Buy 5 units 

L ee 


In the above example, we designed the prices, for the sake of simple illustrations, in such a way 
that changes in the control variables occur only at the end of each month. Generally, this is not true, 
and mid-period corrections in the control variables are often required in continuous control problems. 
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(0,5) u 


(-4,5) 


yas 7 (4,-5) 


(0,-5) 


FIGURE 3 


4. INTERPRETATIONS OF THE OPTIMAL POLICY 


Since the derivations and interpretations of the formulas for the adjoint functions @ and p, the 
switching functions 4, and 42, and the Hamiltonian function # are virtually identical to those given in 
our previous paper [1], we shall only briefly discuss them here. 

To obtain the differential equation for V we calculate 


The solution to this equation with boundary condition ¥(T) = 0 is 
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(28) W(t) = 1 — e®(7-9, 


The interpretation of W(t) is that it is the net increase in value of a dollar invested from time t to T at 
interest rate R. 
The differential equations for @ and p are obtained as follows: 


(w) b=- = (w-)hy(w) 
; # 
p=— = (Whi). 


Substituting (28) into these equations and integrating, we obtain 


T 

(29) o={ hi (we? dr 
t 

and 
T 

(30) ie { hi (v)e®?-Mdr, 
t 


As in [1] the interpretation of (29) or (30) is the total time-adjusted unit cost of storing a small increment 
of wheat or rice from time t to T. 
Substituting these results into the switching functions (10) and (11) we have 


(31) = P,— pye®T-9— 
and 
(32) 45 = P, — preXT-0 — p, 


The interpretation of 4; is the future (time T) price of wheat, Py, less the future value of p,e®T-9 
of paying now (time t) for wheat at price p,, less the total storage cost @ from time t to T of the addi- 
tional wheat bought now. The interpretation of.4» is similar. 


Substituting the results into (9) gives the following form for the Hamiltonian: 
(33) — # = [Re — hi (w) — ho(v)) — a{ mp — msg). 


From this we can interpret the negative of the Hamiltonian as follows: the function —# is the future 
(time T) value of the present (time t) cash flow and consists of two parts: the first part is the cash 
flow due to previous (before time t) decisions, and the second part is the cash flow due to the present 
(time t) decisions. 
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It is obvious that we can extend these results to much more general problems with many more 
control variables, and more complicated constraints. We shall carry out such a generalization in a 
later paper in this series. 

One final interpretation is now possible concerning the dual variables to the maximizing linear 
programming problem (13). It is well known that the dual to a maximizing problem is a minimizing 
problem and that one of these problems has a finite solution if and only if the other does. The in- 
terpretation of the dual solution depends upon the specific units of the various quantities involved. 
As an example, assume that Ap, As, Dp, and D, in (13) are pure numbers. Since the units of the switch- 
ing functions 4, and 42 and measured in dollar per unit and m, and m, are in units per time period we 
conclude that bp, bs, and ba have units per time period also. Hence the dual variables to the linear 
programming problem (13), call them z,, . . . zm, have units of dollar per time period. This means 
that z,(¢) for example, measures the rate of contribution to the final asset position if we relax the 
righthand side of the first constraint by one unit now (time t). This is a rather remarkable interpre- 


tation for it evaluates an action now in terms of future values. 
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